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ABSTRACT 


Western Pacific tropical cyclone position forecast errors for 10 
years (1966-1975) are statistically analyzed. Variations of errors 
versus a dozen parameters are examined and the trends over the 10 years 
are discussed. Discriminant analysis techniques were used to isolate 
categories where forecasts were likely to be above and below the median 
in East-West and North-South error components. The discriminant analysis 
was tested on 1976 data and the results are presented. It was confirmed 
that a small number of readily available parameters, such as location, 
Maximum winds, and speed of movement, can, with reasonable effectiveness, 
classify a tropical cyclone forecast as representing a group either 


markedly above or below average errors. 
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Ee OBJECTIVES 


As discussed in a request from the Director, Joint Typhoon Warning 
Center, Guam (JTWC) to the Naval Environmental Prediction Research 
Facility, Monterey (NEPRF), a need exists for a statistical analysis of 
the JTWC tropical cyclone forecasts to discover the existence of any 
Significant trends. More specifically, the long range goals number 
Eour : 

1. To identify situations where the forecasts are very good or very 
bad, to allow maximum concentration of resources for quick reduction of 
the largest errors. 

2. To provide probability algorithms for an estimate of the fore- 
cast errors of warnings, tO assist Western Pacific commanders in opera-~ 
tional decisions regarding the protection and/or evacuation of military 
resources. 

¥3. To stratify errors for 24- through 72-hour forecasts, based on 
various parameters such as location, time of year, speed of movement, 
intensity, and synoptic patterns. 

a.) (OO detemmune 1f the year to year variations in forecast accura- 
Cies for the 10 year period are real, or random deviations about a long 
eer mean. 

The more immediate short range goals of this research, as a first 
step toward the realization of the long range objectives, are: 

i TOuche@m@menomadata for errors in recording, and to test it for 


reliability as a data base for statistical study. 





2. To assemble and consolidate the data into a usable format. 

3. To determine basic statistical relationships between parameters. 

4. To manipulate the basic data, to create a set of parameters for 
further study of errors in the 24-, 48-, and 72-hour typhoon forecasts. 

5. To perform discriminant and stepwise multiple linear regression 
analyses, to find parameters related to the forecast errors. 

6. To summarize the results and test them by a preliminary applica- 
tion to 1976 data. 

7. To make recommendations as to the direction for continued 


research toward the long range goals. 
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tt PbooeREeTLON OF THE DATA 


The JTWC Western Pacific tropical cyclone forecasts and best tracks 
were matched by date/time groups for 10 years as follows: 


1966 - 34 storms 


Mey — 35 
ne63 >=" 26 
SIS S) a ee 
1S LOS) 
1971 = 35 
19724 gon 
Abe We 2 Tk 
1974 - 34 


IBS Yi Syn 215) Shefepeuts 
The term "storm" used herein refers collectively to tropical cyclones 
(tropical depressions, tropical storms, and typhoons) without regard to 
intensity. The 10 year total was 290 storms, or an average of 29 storms 
per year. The total number of best track positions at six-hourly 
intervals was approximately 6150. 

In the process of matching forecasts with best tracks of the same 
time, the data was checked for errors and corrected, when necessary, us- 
ing annual typhoon reports. Rarely was a report garbled beyond correc- 
tion so aS to require removal from the data. There were some storms, 
however, that were so short-lived as to provide no verifying forecasts. 
Tnese were not represented in the verified case data that was statisti- 


cally analyzed. For instance, to verify a 48-hour forecast and compute 


dieJk 





an error distance for the forecast, a best track must be available 48 


hours later. If, in that 48-hour period, the storm dissipated and was 


no longer identified by a best track position, the forecast could not 


be verified. This also accounts for the fact that fewer cases were 


verified for 72-hour forecasts than for 48-hour, and fewer 48-hour than 


24-hour forecasts. These cases numbered as follows: 


4809 


24-hour forecasts 


3038 48-hour forecasts 


Se 


72-hour forecasts 


As a minimum the following parameters were known for each case at 


morecast Initiation time: 


les 


Maximum Wind MAX WIND 
Latitude LAT 
Longitude LONG 
West-East Component of Storm Movement U MOVT 
South=-North Component of Storm Movement V MOVT 
Position Number on Storm Track POS NO 
Number of Storms in Progress at Forecast NO STM 
Time 

Month MONTH 
Time-GMT of the Forecast TIME 
Error Distance (Nautical Miles) ERR DIS 
Direction from Verifying Position to ERR DIR 


Forecast Position 


The 1976 data was processed in a similar way, but retained separately 


for testing. There were 625 best track positions at six-hourly intervals 


from 25 storms and the verifying cases totaled: 


524 


24-hour forecasts 


424 48-hour forecasts 


332 


72-hour forecasts 


TZ 





IIL. STATISTICAL ANALYSIS 


im. BAcIC STATISTICS 

As a preliminary consideration, evidence of a climatic trend in the 
10 year data was investigated. The number of occurrences of tropical 
cyclones in a fixed time (1 year) can be assumed to follow a Poisson dis- 
tribution if two plausible conditions exist: (a) an occurrence is just 
as likely in one interval as another, and (b) the occurrence of an event 
has no effect on whether or not another occurs. A property of this 
distribution requires the population variance to equal the population 
mean. In this 10 year sample, the variance is 29.89, and the mean is 
29.00. If a climatic change were occurring, then the sample variance 
should exceed the sample mean. As it does not, no climatic trend is 
evident within the 10 year sample. 

The initial statistical analysis of the variables employed the UCLA 
Biomedical computer program BMDO2R stepwise multiple linear regression 
(Dixon, 1970). Tables I and II summarize the means, standard deviations, 
variance explained, and the correlation matrix of the first 10 
variables for the 24-, 48- and 72-hour forecasts. No correlation coef- 
ficient of any available predictor with the magnitude of the errors 
(at either 24, 48, or 72 hours) exceeded 0.185, and the total explained 
variance of the error distance did not exceed 11%. It was noted, however, 
that the variables contributing most to the explained variance were 
MAX WND, LAT, LONG, U MCVT, AND V MOVT. The concept of predicting the 


error directly was abandoned. 


1s) 





From these basic statistics, the average Western Pacific tropical 
cyclone moved from 19°N latitude, 135°E longitude to the WNW at seven 


knots with maximum winds of about 66 knots in late August. 


B. MEAN ERROR STRATIFICATIONS 

For more detail, the forecast errors were stratified and mean errors 
were computed for each stratification along the range of each variable. 
The distance to the nearest storm (STM DIS), for multiple storm cases, 
and initial position error (POS ERR) were added as variables. Signifi- 
cant trends were evident (Figures 1~6). In the figures, stratifications 
were selected to keep the number of cases in each group relatively high. 
Frequencies are typically a few hundred and are not indicated except 
where they drop below 100. As an aid in interpreting these figures, 
relative frequencies, all based on 4809 cases, are shown in Figures 1~6 
for the 24~hour forecasts. Since there are in percentages of the total, 
they roughly apply also for 48~ and 72~hour forecasts. 

In each of the 24~, 48~, and 72~hour forecast situations, the mean 
forecast errors were minimal for lower latitudes (Figure la), gradually 
increasing with latitude. This indicates that storms are more accurately 
forecast before they recurve and move into higher latitudes. It is 
also consistent with a study by Sadler (1967), which distinguished 
between (a) storms originating in the vicinity of the mean August surface 
trough, between 5° and 20°N latitude, that moved mostly to the west; and 
(b) those beginning north of 20° N latitude, in the vicinity of the mean 
August Tropical Upper Tropospheric Trough (TUTT), which were more erratic 
with predominantly northerly components. Both of these concepts support 


smaller forecast errors for storms in lower latitudes. 
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Mean forecast errors decrease with decreasing east longitude (Figure 
lb), or more westerly positions. Generally, a forecast for a storm in 
a more westerly position is one based on a longer than average history, 
perhaps in an area of better synoptic data coverage, given the proximity 
to the Philippines, Taiwan, and China, and other continental areas west 
of 130°E. Additionally, land radar enhances accuracy of location. 
Storms west of 130°E are less susceptible to large forecast errors when 
land is nearby to the north of the track, because storms unexpectedly 
recurving over China dissipate rapidly and hence are not generally 
reflected in forecast verification. 

Maximum wind (Figure 2) is another important parameter. The mean 
errors decrease with increasing maximum wind speeds, indicating that 
better developed storms, again with longer histories and more accurate 
center locations, are more accurately forecast. This trend is visible 
for all three forecast times, with some increasing fluctuations and 
irregularities in the 48-, and 72-hour forecasts, which are based on 
progressively smaller sample sizes. 

Relative to the U component of storm movement (Figure 3a), forecasts 
are generally better for storms moving west and becoming progressively 
more difficult as westward movement diminishes and becomes eastward as 
associated with recurvature. For the V component (Figure 3b), the best 
forecasts are centered at or near zero, again implying better forecasts 
when: the storm is moving west with little or no deflection north or south. 
Errors increased markedly for storms moving south with any component, 
or to the north, as might be associated with the recurvature process. 

Time of day (Figure 4b) showed no perceivable relationship with fore- 


Cast errors at any of the forecast times. For the years 1969-1971 


WS 





forecasts were issued at 0500 GMT plus every six hours, while in the 
remaining years forecasts were issued at 0000 GMT plus every six hours. 
For no obvious reason, forecasts in those three years appear to have 
been superior to forecasts issued at the more normal synoptic times. 
Siemon year, Or Stratification by month (Figure 4a), did show that 
larger errors occurred with the largest freqeuncy of storms in late 
summer to fall. There 1S a consistent improvement in April for all three 
forecasts, but since this is based on only 5% of the cases, its signi- 
ficance is somewhat dubious. The factors of workload and personnel turn- 
Over seem to be reflected in Figure 4a. Most personnel changes occur in 
the spring to early summer months as the frequency of multiple storms 
increase. Mean errors subsequently increase by 20 to 30% in July and 
August, and then taper off through the rest of the season, as the work- 
load stabilizes and as the newcomers gain experience. This trend is 

less pronounced in the 48- and 72-hour data, but in light of the case 
distribution, the argument is not negated. Support for this argument is 
shown by mean errors increasing with the number of storms occurring 
Simultaneously (Figure 5a). This could be indicative of the aforemen- 
tioned added workload on the forecasters, or perhaps due to complicated 
multistorm interaction not fully understood. With the progressively 
fewer number of cases considered for an increasing number of storms, 

the trend is not strongly supported, however. It is noted that four 
storm cases occurred in 1972 only and may reflect the year rather than 
the occurrence of four storms. The relationship 1s reinforced, however, 
in light of the larger errors occurring when less than 600 nautical miles 
Separates two storms. This parameter, the distance to the nearest storm, 


is depicted in Figure 5b. It is in agreement with findings by Brand 
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(1968) that the Fujiwhara effect is not felt beyond 750 nautical miles. 
Beyond that distance, mean errors decreased and stabilized. The para- 
meter of six-hourly point along the track (Figure 6a), a measure of the 
length of storm history, showed a trend congruous with that of maximum 
wind: as the storm's history and development increased, the forecast 
errors decreased. In this case, a minimum mean error occurs late in the 
thaeeeday Of a Storm. This stabilizes in the 24-hour situation, but 
decays for 48- and 72-hour forecasts, as might be expected with storm 
recurvature occurring late in the storm history. 

For the last variable, initial position error (Figure 6b), a consis- 
tent and prominent trend shows the mean forecast errors increasing as 
the initial position errors increase. This supports the basic forecast- 
ing premise that accurate observations are necessary for accurate fore- 
casts. This finding is in general agreement with that of Neumann (1975), 
who found that for Atlantic hurricanes, the initial position error was 
important in objective forecasts with its relative importance decreasing 


in longer-range forecasts. 


C. ANNUAL VARIATION OF ERRORS 

Figure 7 shows the mean forecast errors for each year of the 10 year 
sample, with the least squares linear trend lines. In each case, the 
trend line is too shallow to indicate conclusively an improvement during 
the 10 years. Correlations are negative but less in magnitude than 0.3 
(about one standard deviation from 0.0). While the hypothesis that there 
has been no improvement over the 10 years is suspected, it cannot be 
rejected. Because these main error values were computed including all 
tropical cyclones, they differ from those published in the annual typhoon 


Yeports, which were based only on storms of typhoon intensity. 





D. AUTOCORRELATION OF SUCCESSIVE FORECASTS 

To this point, the forecasts (and hence forecast errors) have been 
tacitly assumed to be independent of each other. In reality, successive 
Six-hourly forecasts for a particular storm are strongly correlated. 
Table III gives the estimated autocorrelation coefficients between errors 
from successive forecasts with lag times out to 36 hours. It is possible 
to adjust the number of related cases in a particular storm downward to 
an effective number of independent cases by a complicated relationship 
given by van der Bijl (1951). The ratio of these two values (the effec- 
tive number of independent cases divided by the total cases) decreases 
With an increase in the autocorrelation coefficient as well as with in- 
creasing numbers of forecasts per storm, and increases with lag time 
between forecasts. For 24-hour forecasts, where typically 15 to 20 fore- 
Casts are made at six-hourly intervals, this ratio is approximately 1/3. 
In the 48-hour case where the autocorrelation is higher, but the typical 
number of forecasts per storm is lower (10-15), this ratio is 1/4 to 
1/3. At 72 hours, the six-hourly autocorrelation coefficient is higher; 
however, the forecasts were usually issued at 12-hourly intervals and 
the typical number of forecasts was about five per storm, thus increasing 
ENesratio toeWZemeco 1/2. 

This ratio is important in significance testing where the square root 
of the number of cases (to be replaced by the effective number of indepen- 
dent cases) is found in the denominator of the test statistic. Whether 
1/4, 1/3, or 1/2 of the number of cases is used as the effective number 
makes little difference in the test statistic, so 1/3 times the number of 
Cases will be used as an arbitrary compromise estimate of the effective 
number of independent cases throughout for the purpose of significance 


testing. 


is 











E. FREQUENCY DISTRIBUTION OF FORECAST ERROR COMPONENTS 

For the purpose of constructing probability ellipses, error components 
have been assumed to be distributed according to a Guassian, or normal, 
frequency distribution. Figures 8a and 8b show the observed cumulative 
frequency distributions of the West-East (U) and South-North (V) components 
of the errors plotted on probability scaled paper, where a normal distri- 
bution would be represented by a straight line. This presentation shows 
generally good agreement between the plotted observed and normal curves 
(straight lines) computed from estimates Berane means and standard devia- 
tions. The maximum differences between the theoretical and observed 


cumulative frequencies are: 


U COMPONENT V COMPONENT 
24HR 4.5% errors < - 70 NMI SeoeeelLOrs <i 70 NMI 
48HR mesos Crrors <= = 70 NMI 3.9% errors < -— 30 NMI 
72HR 2.23 errors < -110 NMI Sa2% Errors <= = 30 NMI 


The Kolmogorov-Smirnov goodness of fit test (Massey, 1951) regards 
as significant at the 5% level, differences in observed and theoretical 
cumulative frequencies greater than 1.36/YN in absolute value. Using 
an effective number of cases of 1/3 (4809) = 1603 at 24 hours, 1/3 (3038) 
—slOls at 4eumemmes, and 173 (1372) = 457 at 72 hours; the cutoff points 
for Significance would be 3.4% at 24 hours, 4.3% at 48 hours, and 6.4% 
at 72 hours. Only the differences between the 24-hour theoretical curves 
and the observed plotted values are significant at the 5% level. Esti- 
mates of the third and fourth moments about the mean (Table IV) reveal 
that the 24-hour forecast errors are skewed west (forecasts are too far 


east) while all other skewness coefficients appear normal. Both components, 


JOS, 





however, appear to be leptokurtic. This is evident in Figure 8, where 
extreme occurrences fall counterclockwise with respect to the theoretical 
lines. This suggests that probability ellipses, based on the assumption 
of Gaussian distributions, may be slightly biased in that 24-hour verify- 
ing positions are more likely to fall out of an ellipse on the west side 
that the east side, and that inner and outer ellipses may not contain the 
Droper proportion of the verifying positions. In general the 10% ellipses 
would be expected to contain more than 10% of the verifying positions and . 
the area beyond the 95% ellipse to contain more than 5% of the verifying 
positions. It is not possible to make a statement about the intermediate 
ellipses between 10 and 95%, but Figure 8 suggests good agreement between 
the theoretical and observed cumulative frequencies there. 

If probability ellipses, or other estimates of future probable error, 
integrated over an area are desired, the observed cumulative distribution 
could be used in place of the Gaussian cumulative distribution. The 
degree of complexity added by such a step, as well as the uncertainty in 
the representativeness of this particular 10 year sample (to the future) 


suggest such a step is not warranted. 
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IV. DISCRIMINANT ANALYSIS 


A. CALCULATION OF DISCRIMINANT FUNCTIONS AND GROUP MEANS 

Seeking to identify forecasts as either good or bad, discriminant 
analysis was used, namely, the UCLA Biomedical computer program BMDP7M 
(Dixon, 1974). With this approach, the cases were divided into groups 
and classification functions were found that best delineate the groups. 
These functions, linear combinations of the variables, would then be 
used to predict the classification of new cases. BMDP7M is the stepwise 
discriminant analysis which identifies the subset of variables that 
Maximizes the difference between groups. Variables are entered into 
the classification function one at a time until there is no appreciable 
improvement in group separation. 

For this study, U and V error components were either good, with the 
absolute values of the error less than or equal to the median; or bad, 
with the absolute value of the error greater than the median. The four 
possible combinations were resolved into three classifications: 

GROUP 1: both U and V components good 
GROUP 2: either U or V good 
GROUP 3: both U and V components bad 
The six variables contributing to the separation of groups included all 


those selected by the linear regression: 


1. Latitude es 

2. Longitude LONG 

3. Maximum Wind MAX WND 
4. West-East Component of Movement U MOVT 


Pa AL 





5S. South-North Component of Movement V MOVT 

6. Number of Storms in Progress at Forecast Time NO STM 
Their means and standard deviations are given in Table V. 

Previously established trends are consistently apparent in the data. 
Group 1 forecasts are associated with lower latitudes, more westerly 
longitudes, faster westerly movement, minimal N-S movement, the more 


intense storms (typhoons), and a fewer number of concurrent storms. 


B. TESTING OF THE DISCRIMINANT FUNCTIONS AND GROUP MEANS 

Each set of forecast cases was then tested by applying the group 
means and classification functions calculated from the BMDP7M program, 
using the 24 hour data. The classification functions derived from the 
24-hour forecasts were also applied to 48- and 72-hour forecasts. This 
resulted in slight loss of discrimination at 48 hours, but has the 
advantage of classifying a forecast into the same category for forecasts 
at all time intervals. 

Using the six resulting coefficients (Cy reeeeer Ce) and a constant 
(c.) for each group, operating on the six selected variables (Xp reece rXe)y 


three functions (f) were evaluated for each group thusly: 


= Mia > Sei Xe teC7y ¢ t= 40203 


Each function value was subtracted from its corresponding component of 
the group means, and the differences were squared and summed to represent 
a vector distance from each group mean. Each case was then assigned to 
one of three groups according to which vector distance was minimal. The 
cases so sorted were counted, and the means and standard deviations of 
the error components and error magnitudes were calculated. These means 


and standard deviations are given in Table IV. The standard deviations 
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of the components and the mean vector errors reflect the differences in 
the groups, with the component means mostly near zero. A Student's t 

test was applied to determine if the mean vector errors of each group 

were Significantly different from those of the other groups. At the 5% 
MevelsOL Significance for a one tail test, t =.1.645. Therefore, if 

the value of t between two groups exceeds that figure, the groups are 
deemed to be significantly different. Values of t computed on the group 
Means are given in Table VII. It should be noted that the number of cases 
was reduced by a factor of 3 to account for autocorrelation as previously 
discussed. 

The mean errors of Group 1 were found to be significantly less than 
those of Group 2, and those of Group 2 were significantly less than those 
of Group 3, except for the 72-hour data. There the mean V components of 
the errors were significantly non-zero and of different signs, giving 
unique spatial error distributions for the two groups with only slightly 


Gifferent mean absolute errors. 


C. GROUP ANNUAL VARIATIONS 

Mean forecasts errors per year per group, and all groups combined, are 
shown in Figure 9 with trend lines. Again the difference between Group l 
and 3 is substantial. The Group 2 average errors most closely approximate 
the pattern of all three groups combined. Year to year fluctuations are 
extremely large for Group 2, and somewhat less for Groups 1 and 3. The 
larger fluctuations in the Group 2 means and in the mean errors of longer- 
range forecasts in all groups is mostly attributable to relatively 


smaller sample sizes. 
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D. PROBABILITY ELLIPSE COMPUTATIONS 

Having thus far separated forecasts into one of three groups, it was 
now desirable to present this information in a more graphical way; the 
goal being a useful operational application. Following previously estab- 
lished methods (Stevens and Palmer, 1963), the probability ellipse is 
such an application. Assume that errors in forecast position approximate 
a bivariate normal distribution (Section III.E). The expression for an 


ellipse is given by: 


with the normalized error components x = (U-U)/s, , and y= (V-V) /s__ ; 
U is the E-W error component; V is the N-S error component. U, v, suf 
and Ss. are the estimates of the respective means and standard deviations; 
and r is the estimated correaltion coefficient between U and V. 

2 
Probability =1 - e Sei c = 1 approximates a 40% ellipse. Figure 10 
shows the 40% probability ellipses for each group at each forecast interval. 
Distance dimensions are nautical miles, areas are in thousands of square 
nautical miles, and directions are in degrees north of east. A 40% 
probability ellipse means that a forecast position has a 40% probability 
of falling within its corresponding ellipse. Only the difference in size 
between Groups 1 and 3 is immediately obvious, but upon comparison of the 
areas, the distinctions are more pronounced. The 24-hour area of Group 3 
is more than double that of Group 1, with the 48- and 72-hour areas being 
97 and 57% larger, respectively. 


The general NW orientation of the major axis of the ellipse indicates 


that for low latitude storms on a normal WNW track, errors are comprised 
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of nearly equal components along the track (speed error), and across the 
track (track error). These storms are usually associated with Groups 1 
and 2. Recurving and post-recurvature storms are almost always in Groups 
2 and 3, with Group 3 predominating as storms are entrapped by the 
westerlies. During recurvature, when the track is nearly north, track 
error is dominant, whereas after recurvature, speed error dominates. 

Using too large an ellipse (such as Group 2 for a Group 1 case) tends 
to dilute and spread the estimated probability density. This has the 
effect of overwarning those customers far removed from the forecast track, 
and underwarning those along the track. Conversely, using too small an 
ellipse (such as Group 2 for a Group 3 case) has the effect of overwarn- 
ing those along the forecast track and underwarning those in the periphery. 
This case is the meteorologist's familiar dilemma when forecasts are 
taken too literally without adequate allowance for errors. Tailoring 
the ellipses to the expected forecasting difficulty has the effect of 


reducing both overwarning and underwarning. 
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V. TESTING OF INDEPENDENT DATA (1976) 


The final step in this research was to apply the same procedures to 


an independent data set and compare the results. 


A. DISTRIBUTION OF ERRORS 

The 1976 forecasts and best tracks, having been processed like the 
10 year sample, were analyzed using the same discriminant functions and 
group means to similarly arrive at three separated groups. the 1976 group 
Statistics are listed in Table VIII. The contrast between group statis- 
tics is not as sharp as in the 10 year independent data sample (Table VI). 
The means vary more widely as compared to the dependent data, while the 
smaller standard deviations of Group 2 show the 1976 forecasts to appear 
Significantly better for those cases. 

So the question arises as to whether errors of 1976 are representa- 
tive of the 10 year data sample. Statistically each group vector mean 
of the 1976 data was compared with its counterpart in the dependent data 
sample. Table IX lists the values of the normal test statistic, Z, 
found for each group to be compared at the 5% level of significance value 
of Z: 1.96. From this, Groups 1 and 3 of 1976 cannot be rejected as 
having come from the sample of the previous 10 years, but Group 2 fore- 
casts appear to be significantly better than the preceding 10 year average. 

On the other hand, Figure 9 shows Group 2 to have a wide annual varia- 
tion, partially because of smaller frequency of occurrence. The relative- 
ly high number of Group 2 cases in 1976 may include only a few storms 


which can negate any significance in the above differences. 
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B. ELLIPSE TESTING 

Forecast errors of 1976 were tested to determine the percentages of 
verifying positions that would fall within ellipses with probabilities 
Specified at 25, 50, 75, 90, and 95%. These results appear in Figure 1l. 
For Group 1, the observed closely follows the 45° expected line with the 
maximum deviation of observed from expected being 10% at 72 hours. 

Group 2 deviations were consistently conservative (above the 45° line) 
with deviations up to 18%. For Group 3 the 72-hour deviation was the 
greatest at 20%, also conservative. None of these differences are signi- 
ficant at the 5% level by the Kolmogorov-Smirnov goodness of fit test. 
Generally, the ellipses were too conservative. This is to be expected 
Since a trend of decreasing errors was evident in the 10 year data. 

For dramatic comparison, the same ellipse testing was repeated, but 
counting the number of Group 3 cases that fell into the smaller Group l 
ellipses and the number of Group 1 cases that fell into the larger Group 3 
ellipses (Figure 12). The comparison shows the Group 3 ellipses to be 
ultra-conservative for Group 1 cases. Conversely, fewer Group 3 observed 
cases fell into Group 1 ellipses, by roughly the same percentages below 
the expected as the other situation was above. This contrast shows 
Significant differences exist in the distribution of errors from fore- 


casts specified in advance to be either Group 1 or Group 3. 
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VI. CONCLUSIONS 


im light Of the objectives outlined for this study, to some extent 
the long-range goals have been attained. It has been demonstrated that 
a small number of readily available parameters can, with reasonable 
effectiveness, classify a tropical cyclone forecast as likely resulting 
in either markedly above or below average errors. Group 1 forecasts 
have a high probability of below average errors with a low probability 
of above average errors. Group 2 forecasts have approximately equal 
probabilities of being above or below average. Group 3 forecasts have 
a low probability of below average errors with a high probability of 
above average errors. 

The concept of using least Squares regression to predict in advance 
the actual error (as opposed to a class of errors) appears to offer 
little chance of meaningful success. It is apparent that it is possible 
BomESOlate Conditions contributing to forecast errors in the mean, but 
one must bear in mind that excellent forecasts are occasionally made 
under the worst conditions, and conversely, terrible forecasts can be 


made under the best of conditions. 
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VII. RECOMMENDATIONS 


The examination of two additional parameters could improve the 
delineation between classifications. First, initial position error, 
shown to be directly related to the forecast errors, was not intro- 
duced as a discriminator because it is not generally known to the 
forecaster at the time of the forecast. 

Second, the synoptic pattern associated with each storm has not 
been considered. Some parameter which accounts for the relative 
locations of semi-permanent features; such as the TUTT, subtropical 
ridges, and perhaps transient troughs in the westerlies; might prove 
to be a most important discriminator, especially as it relates to the 
track forecast errors and the basic problem of forecasting the 
recurvature of tropical cyclones. 

While these results are to be considered preliminary, pending 
improvements and refinements, dissemination to typhoon forecast sub- 


scribers is recommended. 
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TABLE I. BASIC STATISTICS - RESULTS OF 
STEPWISE MULTIPLE LINEAR REGRESSIONS 








VARIABLE MEANS STANDARD VARIANCE EXPLAINED (3%) 
DEVIATION 
24-HR 48-HR 72-HR 
MAX WND (kts) Sere, Z2o.0 3.4 1.4 0.4 
LAT (°N) ice / Gin 4.2 ee ois 
LONG (°E) ES) <i) 14.6 Late 23 Zea 
U MOVT (kts) =O 6.4 Olas: Se Zao 
V MOVT (kts) 4.0 2 02> Oe) Oe 
* 

POS. NO. eS 5S 1656 NE NE NE 
NO. STM. aS Oat OAS OZ NE 
MONTH Gay 263 NE NE Oak 
TIME GMT (hrs) Oc 7.0 NE NE NE 
Peeebis: 24-HR 125.7 30.3 

(NMI ) 

ERR DIS: 48-HR 247.0 3. 

(NMI) 

Pree Dls: /2-HR 369.4 INS) 8) 

(NMI ) 

TOTAL VARIANCE EXPLAINED (3%) 10 es) 3.¢ ano 


* 
NE: Was not entered in linear regression 
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TABLE EEL. 


TIME LAG (HOURS) 


0 
6 
2 
18 
24 
30 
SG 
TABLE IV. 
ie NORMAL 
3rd Moment On0 
Skewness Coef. 
4th Moment B8, 
Kurtosis 
ae 
3rd Moment ORIG, 
4th Moment 5.0 
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AUTOCORRELATIONS BETWEEN FORECAST 
ERRORS OF SUCCESSIVE FORECASTS 


LO LOI6, 006 O00 
OOS - 790 - 838 
2432 Sesyeh 2675 
291 2 432 ~476 
sos 2205 Feige 
ele eon lee? 
wesw 5 db ial Pe Ae 


SKEWNESS AND KURTOSIS COEFFICIENTS 
OF ERROR COMPONENTS 











24-HR 48-HR 72-HR 
* 
=o a GAS .046 
x * 
4.434 SF ees Sao Ze 
Oke Js. meet BOG? 
* ® 
4.544 3.881 emo ces 


Significantly different from the Gaussian values. 
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TABLE SVL « 


TEST GROUP STATISTICS 


FORECAST MEANS > UD wDEV. MAGNITUDE OF VECTOR ERROR 
24-HR 
GROUP CASES U V U V MEAN SED e DEV. 
af 1834 oe Seu 87.9 (re 93.36 SJ a 
2. SEO -16.0 adn Hero. 4 heres) 130.4 84.0 
3 1665 6.5 a el 29.6 118.4 ies eat 89.8 
48-HR 
Jt 1432 sills, 9 4.1 1s) S, 2) 151.4 Pale Ea T3053 
2 Vas “19.4 -31.4 25760 7a 2422 den Lax 
3 873 ee LS; 6 PLS ore 2o5.0 PLSNSy 7) 72 
72-HR 
1 623 -14.4 ae 609.5 235.4 826.0 Zee 
2 367 “8.1 -70.5 S10'e2 249.3 SRS ee PAS 8, 
3 So 17.0 2725 368.4 S02 .5 418.5 230.6 
TABLE VII. STUDENT'S t VALUES COMPUTED ON 
GROUP MEANS (10 YEAR DATA) 
24-HR 48-HR 72-HR 
Groupe! VS. 2 eee Pad aya: fa a1) 
Group 2 vs. 3 Sa 3.48 0.92 
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FORECAST 
24-HR 
GROUP CASES 
1 144 
2 240 
3 140 
48-HR 
a 133 
2 202 
3 89 
72-HR 
al 109 
2 164 
3 aye) 
GROUP 
a 
Z 
3 


TABLE VEEL. 


MEANS DED. DEV. MAGNITUDE OF VECTOR ERROR 
U V U V MEAN SID. DEV. 
SAH056 ile, S455 76 30.6 104.6 62.6 
= 25510, Se. AS) AERA Theat! 110.4 Soe 
3 <5 B23) 111.9 122.4 143.3 S4.2 
mote 24.057 196.5 160.1 PLEA SP) eS ai 
aoe et So yee lO. 145.9 ZO 121.4 
a2,cmeoo.e 214.7 195.6 265.2 Lh SOLS 
=O —35.2 9528.0 240, 1 Sa7.3 225.9 
Sec lee me CSO.) 2003 ei2.4 ea 
Son woo. 0 6557... 204.5 B76 oO) jbo Be, 
TABLE IX. NORMALIZED DEVIATIONS OF 1976 


1976 TEST GROUP STATISTICS 


GROUP MEANS FROM 10 YEAR GROUP MEANS 


24-HR 48-HR 72-HR 

0.60 @. 50 0236 
atl ees: ails othe. 
=OcoL ASSIS ec 
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Mean Error Stratifications by Latitude, Longitude. 
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Mean Error Stratifications by Maximum Wind. 
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Fig. 3a. West—East MOVEMENT 3b. South—North MOVEMENT 
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Figure 3. 


Mean Error Stratifications by West-East, South- 
North Components of Movement. 
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Fig. 4a. MONTH 4b. TIME 
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Figure 5. Mean Error Stratifications by Number of Storms, 
Distance to the Nearest Storm. 
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MEAN ERROR STRATIFICATIONS 
Fig.6a. POINT ON TRACK 6b. INITIAL POSITION ERROR 
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rigure 6. Mean Error Stratifications by Point on Track, 
Initial Position Error. 
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Figure 7. Mean Error Serateitrcacions by Year. 


42 


800 


700 


600 


500 


400 


300 


200 


148 





o0ost- 


Ootct- 


oolt- 


006- 


00l- 


00G- 


O0Cc- 


oo}l- 


oot 


OOF 


00S 


ool 


006 


oot 


oocl 


00St 
Wy 


66°66 


000- 
00L- 
009- 
00s- 
00b- 
00¢- 
002- 


OOol- 


oot 
00¢ 
OO€ 
OOP 
00S 
009 
OOL 


008b 


1WN 
6666 


*juauoduiog gn - uoT3angqtTazysta Aouenbeszq zoAAW °eg sanbtA 


(%) AONZNOZIYI YJAILVINWNS 


6°66 66 06 0S oO 1 10 10'0 
ey m 2 008- 
of ; eA 
OOL- 
a; en % 
va oe 009- 
n ee * 
eA a * OOS- 
e Za 5 
ae —“~lIOOr- 
en” % 
/ a 
ae ooe- 
ae 8 
7 oe y O0C- 
| / 
SE Ool- 
L. 
ae O 
ak IWN 
es 
Jz 
ZF inoH ZZ 
* a INOH Bb 
% a” o 
# Pa 0,7 ¥ INOH UC 
ae of 
wae J1dWVS 1VWHON 
oA pa 
ee ra INSJNOdWOO A 
P a “4 
a : ie NOILNGIMLSIG AONSNOAYS Yous 
A aaa 





666 66 06 OS 


LNSNOdWOD ¥YOUuNS 


43 





oot 


O06- 


0O0/- 


0O0S- 


OOr- 


OOol- 


oO - 


ool 


oor 


00S 


00d - 


006 


oot 


oot 


o0oSI 


Wy 


*QuoUuOdUOD A 


66°66 6'66 66 


OO8- 


0O2- 


009- 


OO0SG- 


OOv- 


OOE- 


O0ZC- 


OO}t- 


OOl 


00d 


OOt 


OOv 


OOS 


009 


OOL 


008 


IIWN 
66 66 


6°66 66 


- uotanqtazaqystq Aouenberzy A0AAT 


06 


(%) 


AONANOAYA SAILVIAWNS 
OG 





Ad 1IdWVS 


Ol 


i 


-qg oanbtg 


t°O 100 


008- 


rae —J00z- 
«| e 
eee 


a 7 © a 
Fs 
. a , ¢ 7 
me 
a e 7 # 
v- . 
o 4 2 


INOW ZL 
4NOH Bp 
ANOH VTS 


IWWYON 


I{NANOdWOD A 


NOILNGINLSIG AONSNOSYS YOuUUS 
‘qg ‘614 


OS 





009- 
0OS- 
OOb- 
OO€- 
00C- 


OOl- 


IWN 


LNSNOGWOD wOUdS 


44 





*[(---) SeUuTT puezL AeSsUTT SeAeNbS JseeT YATA) 
Gdnoay rq SUOTIEOTIJTFLCAWS AOAAWG Yseoozr0yA Tenuuy uesp °G sanhty 


YUVAA 
(92) bl 72 OL 99 99 (91) be GL OL 89 99 (94) tz ZL OL 89 99 (91) tl GL OL 89 99 
O 0 
snoy ZL o—ax InNOH @p e—e NOL] yo 


ae 


OOE 










OOS 

OOE 
Tr | dNOUD 

-1OO0b 

SdNOUD Tv Z dNOUud 

006 € dNOUd 

00S 
Wy IWN 


6 614 





yOUNS LSVOZYOs NVIW 


45 





-600 -400 -200 0 200 400 600 ~KM 
' 











Fig.10 
ae 400 
- es GROUP 1 
HR 24 48 72 
U -6 “15 -15 
0 : V 3 4 2 
Sy 90 196 310 
-100 -200 Sy 74 11 235 
r AZ 1S 46 
DIR 19° 57-9 940? 
~200 -400 Se 90 209 339 
AXES! 74 133 #191 
AREA 206 87.3 2034 
NMi— -300 -200 -100 0 100 200 300 
GROUP 2 
200 HR 24 43 72 
U -16 -19 -8 
V -5 -31 -70 
- Sy 119 0 237) 76 
S, 97 182 252 
-200 r 14 .26 34 
DIR 17° 22% 20° 
SEMI) 121 247 386 
-400 ea 94 170 228 
AREA 359 1319 2759 
-600 
NMI - -300 -200 ~100 0 100 200 300 
{ 
; GROUP 3 
(c) HR 24 48 72 
300 vy 8 19 17 
V -3 16 28 
200 Sy 130 259 368 
S, 118 223 302 
r 20 veg 41 
100 DIR 32° 332° 3.2 
SEM!) 137 280 405 
0 AXESS 110 196 252 
AREA 47.3 1721 319.9 


-100 


-200 





ALL DISTANCES: NMI 
AREAS: THOUSANDS OF SQ. NMI! 
DIR: LOCATES MAJOR AXIS(N of E) 
U.V :LOCATE ELLIPSE CENTER 


} ! | 
-600 -400 -200 0 200 400 600 —KM 


Figure 10. 40% Probability Ellipses Per Group Per Forecast. 


46 





(% ) 


OBSERVED CASES 





Fig. 11 
GROUP 1 


24 Hour 48 Hour 








GROUP 2 
100 A 100 
i/ 
US 75 
50 50 
25 25 
0 0 
255 SO 75 = 100 O 25a 50 ieee) 
GROUP 3 


100 





EXPECTED BGAScS. (%) 
+ + MAXIMUM DEVIATION (OBSERVED TO EXPECTED) 


Figure 11. 1976 Verifying Positions that Fell into 25, 50, 
75,90, and 95% Probability Ellipses by Group 
emcee Lecase (24) 48, and /2 hours). 


47 





(%) 


OBSERVED CASES 


Fig.12 
(a) 


GROUP 1 CASES IN GROUP 3 ELLIPSES 


24 Hour 48 Hour 72 Hour 
100 
75 LE 
/ 
50 a 
A 

25 

ve N,=109 

y 
O 





(b) 


Naz 140 


20 50 


25 50 (f=) 100 0 25 50 is) 


GROUP 3 CASES IN GROUP 1 ELLIPSES 


100 100 
75 N,=89 15 
50 50 
PAS 25 
75 100 " @) 25 50 igs. 100 . O 23 5 Vee! 


EXPECTED CASES (7) 


| | MAXIMUM DEVIATION (OBSERVED TO EXPECTED) 


Eraoune 12. 


Contrast of Group 1 and Group 3 1976 Verifying 
Pacmevons that rel! into 25,50,/75,90, and 95% 
Prebabti tty sitipses for Group 1 and Group 3 by 
BOuecast, (242046, ana 7/2 hours). 


48 





BIBLIOGRAPHY 


/ Annual Typhoon Reports, 1966-1976: Fleet Weather Central/Joint Typhoon 


Joint Typhoon Warning Center, Guam. 


Atkinson, G. D., Forecasters Guide to Tropical Meteorology, Technical 
Report 240, Air Weather Service (MAC) USAF, April 1971. 


Peancdywo., Interaction of Binary Tropical Cyclones of the Western North 
Pacific Ocean, NAVWEARSCHFAC Technical Paper No. 26-68, September 


LISS . 


Brooks, C. E. P., and Carruthers, N., Handbook of Statistical Methods 
in Meteorology, Her Majesty's Stationery Office, London, 1953. 


EMetngteen, R. S., and May, D. C., Handbook of Probability and Statistics 
with Tables, Handbook Publishers, Inc., Sandusky, Ohio, 1953. 


Dixon, W. J., BMD Biomedical Computer Programs, University of California 
Press, Berkeley, 1970. 


, BMDP Biomedical Computer Programs, University of California Press, 
Berkeley, 1975. 


/Jarrell, J. D., Personnel Communication, 1977. 


Massey, F. J., "The Kolmogorov-Smirnov Test for Goodness of Fit," 
Journal of the American Statistical Association, Vol. 46, p. 68-78, 
19 5 1! 


Neumann, C. J., The Effect of Initial Data Uncertainties on the Perform- 


ance of Statistical Trovical Cyclone Prediction Models, NOAA 
Technical Memorandum NWS SR-81, March 1975. 


7, B Statistical Sieudy. Of Tropical eyclone Positioning Errors with 
Economic Applications, NOAA Technical Memorandum NWS SR-82, March 


Ie ese 


Panofsky, H. A., and Brier G. W., Some Applications of Statistics to 
Meteorology, University Park, PA., 1968. 


Sealer, J2°C., ithe PEowlcal Upper Tropospheric Trough as a Secondary 


Source of Typhoons and a Primary Source of Tradewind Disturbances, 
Contract No. AF19(628)-3860, Final Report, Hawaii Institute of 


Geophysics, University of Hawaii, March 1967. 


, A Role of the Tropical Upper Tropospheric Trougn in Early Season 
Typhoon Development, NAVENVPREDRSCHFAC Technical Paper No. 2-74, 


June, 1974. 


49 





Sadler, J. C., mEooucal Grelone interaction bymeuc mt cop lcal Upper EEono- 


spheric Trough, NAVENVPREDRSCHFAC Technical Paper No. 2-76, February 
1976. 


Stevens, W., and Palmer, C. A., An Examination of the Distribution of 


bueetcane Forecast Errors Using Probability Ellipses, NWRF 12-1063-81, 
October 1963. 


maeprcat Cyclone Conference Proceedings Reports, 1974 and 1976, 


Environmental Group Pacific Command. 
Van der Bijl, W., "Punt Fehlerquellen in Wissenschaftlicher Statistischer 


Forschung," Annalen der Meteorologie, Hamburg, Vol 4, p. 183-212, ° 
1951. 


/ Yamane, 1T., otatistics, An Introductory Analysis, Harper and Row, New York, 
1964. 


50 





uO. 


TAD 


ENITIAL DISTRIBUTION LIST 


Defense Documentation Center 
Cameron Station 
Alexandria, Virginia 


Library, Code 0142 
Naval Postgraduate School 
Monterey, California 93940 


Wieeece ww. talciner, Code 63Ha 
Chairman, Department of Meteorology 
Naval Postgraduate School 

Monterey, California 93940 


Gane. OD. varrell, Code 35 
Naval Postgraduate School 
Monterey, California 93940 


Mr. Samson Brand 


Naval Environmental Prediction Research Facility 


Naval Postgraduate School 
Monterey, California 93940 


Aix Weather Service 
AWVAS/TF 
Seoeeeree, Ellinois 62225 


Capt. Harry Hughes 
aR OD AO LOS 
Wright-Patterson AFB, Ohio 45433 


Director omwe Box 17 
FLEWEACEN, COMNAVMAR 
FPO San Francisco 96630 


Dr. W. van der Bijl, Code 63Vb 
Naval Postgraduate School 
Monterey, California 93940 


Capt. J. D. Shewchuk 

Det 1, lWW 

COMNAVMAR Box 17 

FPO San Francisco 96630 


Lines Dponaidss. Nicklin 


3304 Coffey Avenue 
Omaha, Nebraska 68123 


ae 


No. 


Copies 


2 


of ay 




















Thesis 


N54 
é.1 


JING 
ae 


ry a a 


Nicklin 


A statistica] ana]- 
ySis of Western Pacific 


Cropical cyclone fore-~ 
Cast errors, 


shesN54missing 


